Method of digitally compressed video and audio data

ABSTRACT

A data recording system for recording in a CD-ROM or similar recording medium a data file containing data such as compressed audio and video data and an index file containing index data for searching the individual data of the data file. The data stored in the data file are recorded in a CD-ROM in three hierarchical stages, i.e., titles, scenes and shots or chapters, scenes and clips. The index file has a hierarchical format associated with that of the data file in order to manage the data of the data file in consecutive stages.

This is a continuation of application Ser. No. 08/225,457, filed Apr. 6, 1994 now abandoned, which is a divisional of application Ser. No. 08/048,207, filed Apr. 20, 1993, which is a Continuation of application Ser. No. 07/603,054, filed Oct. 25, 1990.

BACKGROUND OF THE INVENTION

The present invention relates to a data recording system for recording in a CD-ROM or similar recording medium a data file containing data such as compressed audio and video data and an index file containing index data for searching the individual data of the data file. More particularly, the present invention is concerned with the formats of the data and index files. Equipment of the type using optical disks has been developed in a variety of forms. Among the optical disks, digital audio disks in the form of compact disks (CD) are predominant in the audio disk market over traditional LP disks or similar grooved disks due to the non-contact playback and faithful sound reproduction capabilities. Today, extended applications of such CDs to personal computers and other various data processing equipment as mass storages are attempted to take advantage of the extremely great storage capacity and the ease of handling and replacement. For example, CR-ROMs, CD-Is (Interactive) and CD-ROM/XAs (Extended Architecture) are the recent achievements. Such a latest type of CD is capable of recording not only text, graphics and other still pictures but also moving pictures, sound and various kinds of codes in combination, and reproducing the individual data in an interactive fashion.

It has been customary to compress audio data and video data including still and moving pictures when it is desired to record them together in the above-described type of recording medium. The compression is successful in promoting rapid read-out of the individual data at the time of playback. Index data associated with the individual data are recorded in the medium together with the video data, so that the compressed audio and video data may be searched to read out desired data. A prerequisite is, therefore, that a data file containing the auido and video data and an index file containing the index data each be provided with a particular format that allows the data to be readily prepared, edited, recorded, reproduced, and searched at the time of reproduction. Formats and data recording systems which meet such a requirement have not been been reported yet. The problem with CD-Is and CD-ROM/XAs adopting a sector interleave system which uses the subcode of CDs is that they cannot record audio and video data efficiently and cannot easily synchronize the two different kinds of data in the event of playback.

SUMMARY OF THE INVENTION

It is therefore an object of the present invention to provide a data recording system capable of easily recording in a recording medium a data file which contains compressed audio and video data and an index file containing index data for searching the data file, and allowing desired data to be searched for with ease.

It is another object of the present invention to provide a data format of a data file containing compressed audio and video data to be recorded in a recording medium, and a data format of an index file containing index data for searching the individual recorded data.

It is another object of the present invention to provide a generally improved data recording system.

In accordance with the present invention, in a data recording system for recording in a recording medium a data file containing data including compressed audio data and compressed video data, and an index file containing index data for searching the data of the data file, the data contained in the data file have a hierarchical format, and the index file has a hierarchical format associated with that of the data file. The index file contains addresses on the basis of at least a minimum access unit of data.

BRIEF DESCRIPTION OF THE DRAWINGS

The above and other objects, features and advantages of the present invention will become more apparent from the following detailed description taken with the accompanying drawings in which:

FIG. 1 shows a method of compressing audio and image data frame by frame;

FIG. 2 shows a format of a data file representative of an embodiment of the data recording system in accordance with the present invention;

FIG. 3 shows a specific arrangement of a data type included in the format of FIG. 2;

FIG. 4 shows a specific format of audio data of FIG. 1;

FIG. 5 shows a specific format of video data of FIG. 1;

FIG. 6 shows a format of an index file to be recorded together with the data file of FIG. 2;

FIG. 7 shows a format of a data file representative of an alternative embodiment of the present invention;

FIG. 8 shows dummy data included in the format of FIG. 7;

FIG. 9 shows a specific format of video data included in the format of FIG. 7;

FIG. 10 shows a format of an index file to be recorded together with the data file of FIG. 7, and a specific arrangement of a chapter index thereof; and

FIG. 11 shows a specific format of a root index included in the index file.

DESCRIPTION OF THE PREFERRED EMBODIMENTS

In illustrative embodiments of the present invention which will be described, data that may be recorded in a recording medium include various types of data such as text data in addition to audio and video data. These data can be handled in the same manner with no regard to their types by having their types designated. The embodiments, therefore, will concentrate on audio and video data that are considered most relevant thereto. While the recording medium is available in various forms such as a disk and a tape, the embodiments will be described in relation to a CD-ROM by way of example. Let it be assumed that the recording medium stores a data file containing compressed audio and video data and an index file containing index data for facilitating the search of the individual data of the data file.

A reference will be made to FIG. 1 for describing a procedure for compressing video data representative of a still or a moving picture and audio data. As shown, video data 101 is made up of consecutive frames or pictures V₁, V₂, V₃ and so on, while audio data 201 is constituted by sound A associated with the individual frames V₁, V₂, V₃ and so on of the video data 101. The sound A is not divided into frames since it has customarily not involved the concept of "frame". In the figure, the sound A is divided into frames in association with the pictures V₁, V₂, V₃ and so on for convenience's sake, whereby audio data 202 made up of frames or sounds A₁, A₂, A₃ and so on is generated. Such video data 101 and audio data 202 are compressed frame by frame to produce compressed video data 102 and compressed audio data 203. Specifically, the compressed video data 102 has compressed pictures V'₁, V'₂, V'₃ and so on and areas where no data exists as indicated by hatching, while the compressed audio data 203 has compressed sounds A'₁, A'₂, A'₃ and so on and areas where no data exists as also indicated by hatching. Subsequently, the hatched areas with no data are omitted from the compressed video and audio data 202 and 203. The resulting compressed video and audio data are combined on a frame basis to produce compressed data 300. The compressed data 300 is recorded in a medium which is implemented as a CD-ROM.

Hereinafter will be described illustrative embodiments of the present invention which record in a CD-ROM a data file including the compressed data 300, i.e., the compressed audio and video data and an index file including index data adapted to search the data 300.

A data file particular to a first embodiment of the present invention will be described first. As shown in FIG. 2, the data file has one or more titles each being constituted by one or more scenes. The scenes each has one or more shots which in turn have a plurality of frames and a dummy area each. In each shot, the frames each is headed by a start code to be distinguished from the others. Provided at the trailing end of the shot, the dummy area has dummy data therein so that the leading end of the shot coincides with the leading end of a block of the CD-ROM. The frames each contains audio and video data in the form of combinations of data types, data lengths and data. FIG. 3 shows a specific format representative of a data type. In FIG. 3, the data type is represented by 8-bit (one byte) data b₇ to b₀, and the kind of data is represented by the bits b₇ to b₄, for example. Specifically, the audio and video data may be represented by b₇ b₆ b₅ b₄ =0010 and b₇ b₆ b₅ b₄ =0000, respectively. The four bits b₇, b₆, b₅ and b₄ are followed by three spare bits b₃ to b₁ and a link bit b₀ which indicates the continuity of the frame. For example, the link bit b_(o) may be "0" if data belonging to the same frame ends there or "1" if otherwise. The data length is representative of the number of bytes of data.

The audio data 201 of FIG. 1 has a scene-oriented format and, as shown in FIG. 4, constituted by a data type, a title, a data length, and data which are arranged in this order. The data type identifies audio data. The scene-by-scene format of the audio data is adopted since the amount of audio data is generally smaller than that of video data. The audio data has a slightly longer length than the actual scene so that it may be recorded in a CD-ROM together with the video data while alternating with the latter. The audio data has a data length of 8,000 bytes per second or approximately 267 bytes per frame. On the other hand, the video data has a shot-by-shot format. Specifically, as shown in FIG. 5, the video data 101 of FIG. 1 has a data type which is a data identification signal, a title given to the shot, the number of frames included in the shot, and frame-by-frame data lengths and data alternating with each other. The audio and video data are synchronized to each other at the leading end of each scene. Which byte of the audio data as counted from the leading end should lead the scene is instructed at the time of editing the data, and usually it is the first byte.

The audio and video data each having a particular format as stated above and recorded in a CD-ROM are searched by an index file having a hierarchical scene-shot-frame structure. An index file applicable to this embodiment will be described hereinafter.

As shown in FIG. 6, the index file is made up of a data length representative of the size of the file, a title given to the entire file, the number of scenes, and scenes. Each scene has a scene number representative of the position of the scene as counted from the leading end, a scene title, the number of shots included in the scene, audio data used, a title of the audio data, a spare field for extension, and shots. Each shot has a shot number representative of a position of the shot as counted from the leading end of the scene, an address, video data used, a title of the video data, the number of frames included in the shot, and a spare field available for extension. The address is a block address of a CD-ROM and shows the position of the leading end of the shot in terms of block as counted from the leading end of the data file. The leading end of the shot data is coincident with the leading end of a block of a CD-ROM, so that the data may be accessed randomly at the time of playback. For this purpose, each shot is selected to be an integral multiple of a block (2,048 bytes).

To record the data file and index file particular to this embodiment as well as to produce and edit them, it is necessary to determine beforehand the filing formats of audio and video data. In the illustrative embodiment, therefore, a predetermined audio and video data file and a CD-ROM file are prepared beforehand, given data is read out of such a file and then edited on the basis of the ROM file, and the edited data is written to the CD-ROM.

Referring to FIG. 7, a data file representative of an alternative embodiment of the present invention will be described. As shown, the data file has one or more chapters each comprising one or more scenes. Each scene has one or more clips each being made up of a sequence of frames and dummy data provided at the trailing end of the frame sequence. As shown in FIG. 8, the dummy data allows the leading end of its associated clip and, therefore, the leading end of the next clip to coincide with the leading end of one sector of a CD-ROM without fail. Should the leading end of a clip begin at the middle of a sector of a CD-ROM, detecting it would be difficult since data stored in a CD-ROM is read out on a sector basis. Each frame has a start code at the beginning thereof, and audio and video data in the form of combinations of data types, data lengths and data. Of course, the data may also include text data or similar data, and its kind is recorded in the data type. The data type may be provided with exactly the same format as the format shown in FIG. 3. If desired, among the eight data bits b₇ to b₀, the bits b₇ and b₆ may represent the kind of data, the bit b₁ may be a spare bit, the bit b₀ may be a link bit, and the bits b₅ to b₂ may represent a channel number.

As shown in FIG. 9, in the video data, each frame has a header and block data whose length is variable. The header is made up of a frame number, a still/moving picture type, a frame size representative of the number of pixels counted in the horizontal direction of the frame, a frame size representative of the number of pixels in the vertical direction of the frame, and quantizing tables respectively assigned to a Y and a C signal.

As shown in FIG. 10, an index file of the illustrative embodiment has a root index and a chapter index which is made up of chapters 1 to N. Each chapter, e.g., chapter 1 has the number of scenes (e.g. M chapters), the number of clips constituting each scene, and a clip index. Each clip index, e.g., clip index 1 has an absolute address, the number of sectors, an attribute, and a reservation field. The attribute is representative of the kind of data constituting the clip. As shown in FIG. 11, the root index has chapter addresses each being assigned to respective one the chapters and the numbers of sectors occupied by the individual chapters.

In summary, a data recording system of the present invention having a data file and an index file each having a unique format as described above achieves various unprecedented advantages, as enumerated below.

(1) The index file having a hierarchical format facilitates the management of data.

(2) Different kinds of data each having a variable length can be recorded in a single frame by use of data types and data lengths.

(3) A particular code (start code) leading each frame implements the recovery from data read errors at the time of playback.

(4) A shot or a clip which is the minimum unit for access coincides at the leading end thereof with the leading end of a sector of a recording medium without fail, so that the leading end of a shot or that of a clip can be detected with ease.

(5) Audio and video data are linked on a frame basis and, therefore, can be readily synchronized at the time of playback.

(6) With a link bit which is constituted by the last bit of a data type, it is possible to determine whether or not data being read out is to be followed by another type of data within the same frame.

Various modifications will become possible for those skilled in the art after receiving the teachings of the present disclosure without departing from the scope thereof. 

What is claimed is:
 1. A method of digitally recording a plurality of data groups, each of which has composite data segments, on a recording medium, comprising the steps of:providing a recording medium having a plurality of physical segments, each of which has a leading end; and recording on said recording medium said plurality of data groups, wherein each of said composite data segments has (1) plural segments of compressed motion picture data, compressed by efficient coding, with a data length variable on a unit time basis, and (2) corresponding audio data divided by time division at predetermined time intervals so that the divided audio data are synchronous with a motion picture represented by the compressed motion picture data; wherein one of said plurality of data groups extends over more than one of said plurality of said physical segments; wherein a leading end of each of said plurality of data groups coincides with the leading end of a respective one of said plurality of physical segments; and wherein each of said plurality of data groups represents a shot or clip of a scene of said motion picture.
 2. The method as claimed in claim 1, further comprising the step of recording header information at a head of a frame of said compressed motion picture data or at a head of a field of data and at a head of said divided audio data, said header information including at least information representative of a characteristic of the corresponding data.
 3. The method as claimed in claim 1, wherein said recording medium has a physical division unit having a predetermined number of bytes and coinciding with a head of said frame of a shot or a clip.
 4. The method as claimed in claim 1, further comprising the step of providing an index recording portion for storing at least information representative of a position on said recording medium where the leading end of said each group and the leading end of said one physical segment coincide.
 5. The method as claimed in claim 1 wherein said each data group is headed by information representative of the length of the group.
 6. The method as claimed in claim 1 wherein each data group includes an idle region at a trailing end thereof such that said each group has a length which is an integral multiple of said physical segments. 